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Abstract 

These informal notes deal with some basic properties of metric spaces, 
\^ | especially concerning lengths of curves. 
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1 Definitions and notation 

A metric space is a nonempty set M with a distance function d(x, y) defined 
for every i,!/ £ M. More precisely, d{x 1 y) is supposed to be a nonnegative real 
number which is equal to if and only if x = y, which is symmetric in x and y 
in the sense that 

(1.1) d(y,x) = d(x,y), 
and which satisfies the triangle inequality 

(1.2) d(x,z) <d(x,y)+d(y,z) 

for every x,y,z G M. For example, the discrete metric is defined by putting 
d(x,y) = 1 when x ^ y, and it is easy to see that this satisfies the preceding 
conditions. 

A more interesting example is the real line R with the standard metric. If 
a: is a real number, remember that the absolute value \x\ is defined by |x| — x 
when x > and \x\ = —x when x < 0. It is well known and easy to check that 

(1.3) \x + y\<\x\ + \y\ 

for every x, y E R, and the standard metric on R is defined by 

(1.4) d{x,y) = \x-y\. 

If (M,d(x,y)) is any metric space and E is a nonempty subset of M, then the 
restriction of d(x,y) to x,y E E defines a metric on E, so that E becomes a 
metric space too. 

Let (M, d(x, y)) be a metric space. The open ball B(x, r) with center x E M 
and radius r > is defined by 

(1.5) B(x,r) = {y E M : d(x,y) < r}. 

Similarly, the closed ball B(x,r) with center x € M and radius r > is defined 

by 

(1.6) B(x, r) = {y e M : d(x, y) < r}. 

Thus B(x, r) contains only x when r = 0. 

A set E C M is said to be bounded if there is a point p £ M and a nonneg- 
ative real number t such that 

(1.7) d(p,x)<t 

for every x E E. By the triangle inequality, this implies that 

(1.8) d(q, x) < d(p, q) + t 

for any q E M and x E E. One can use this to check that the union of finitely 
many bounded sets is bounded. 
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2 Norms on R n 

Fix a positive integer n, and let R" be the space of n-tuples x = {x\, . . . , x n ) of 
real numbers. As usual, the sum x + y of x, y £ R" is defined coordinatewise, 
so that the ith coordinate of x + y is the sum of the ith coordinates of x and 
y, 1 < i < n. If t £ R and x £ R™, then the scalar product tx is defined by 
putting its ith coordinate equal to txi. 

A norm on R™ is a nonnegative real- valued function N(x) defined for x £ R™ 
such that N(x) — if and only if x = 0, 

(2.1) N{x + y) < N{x) + N(y) 
for every x, y £ R™, and 

(2.2) N(tx) = \t\N(x) 
for every t £ R and x £ R™. In this case, 

(2.3) d(x,y)=N(x-y) 

defines a metric on R™. 

For example, the absolute value function \x\ is a norm on the real line. The 
standard Euclidean norm on R™ is defined by 



(2.4) M=(£0 



1/2 

3 = 1 



It is well known that this satisfies the triangle inequality, and hence is a norm. 
The corresponding metric is the standard Euclidean metric on R™. As another 
example, 

n 

(2.5) iwii = En 

is a norm on R". For any real number p > 1, it can be shown that 



(2-6) M P =(EN P ) V " 



is a norm on R n . This is the same as the standard Euclidean norm \x\ when 
p = 2, and the triangle inequality can be established using the convexity of the 
function \r\ p on R when p > 1. It is easy to check directly that 

(2.7) H^Hoo = max(|a;i|, . . . , \x n \) 

is a norm on R n . Because 

(2-8) Woo<W| p 

and 

(2.9) ||i|| P <n 1/p ||a:|| M) 
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when 1 < p < oo, \\x\\ p — > ||a;||oo as p ^ oo. 
If N is any norm on R" , then 

(2.10) N(x) < N(y) + N(x — y) 
and 

(2.11) N(y)<N(x)+N(x-y), 
and hence 

(2.12) |JV(a;) - JV(i,)| < JV(a; - i/) 

for every x,y G R™. This implies that N is continuous with respect to the 
metric associated to N. One can also check that N is bounded by a constant 
times the Euclidean norm on R™. It follows that N is continuous with respect 
to the standard Euclidean metric on R". 



3 The unit circle 

The unit circle S 1 is the set of x = (xi,x 2 ) G R 2 such that \x\ = 1, or 

(3.1) x\+x 2 2 = l. 

The restriction of the standard Euclidean metric \x—y \ on R 2 to x, y G S 1 defines 
a metric on S 1 , but there is another metric that is more intrinsic. Specifically, 
let d(x, y) be the length of the shorter arc connecting x to y in S 1 . This is the 
same as the angle at the origin between the line segments to x and y. Clearly 
d(x, y) > is symmetric in x and y, and is equal to exactly when x = y. The 
total length of the unit circle is 2ir, and hence 

(3.2) d(x,y)<n 
for every x, y G S 1 . Furthermore, 

(3.3) d(x,y)=w 

if and only if x and y are antipodal points in the circle, which means that 
y = —x. If x,y,z G S 1 , then the shorter arcs connecting x to y and y to z 
can be combined to get an arc between x and z, which implies that the triangle 
inequality holds. 

There is a simple relationship between d(x,y) and \x — y\, which is that 

(3.4) sin(^)=tM. 
In particular, 

(3.5) \x-y\< d(x, y) < - \x - y\ 

for every x, y G S 1 . This can be improved when \x — y\ is small, since 

„n sinr 

(3.6) lim = 1. 

r^O r 
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Specifically, for every e > there is a S > such that 

(3.7) d(x,y)<(l + e)\x-y\ 

when \x — y\ < S. 
Put 

(3.8) e(t) = (cost, sin t) 

for each t G R. This defines a mapping from the real line onto the unit circle 
that satisfies 

(3.9) e'(t) = — e(t) = (sint, - cost). 
Hence 

(3.10) |e'(t)| = l 

for every t G R, which means that the length of the arc traced by e(t) for 
a < t < b is equal to b — a for every a, b G R with a < b. Thus 

(3.11) d(e(a),e(b)) = b-a 
when b — a < n. 



4 A little geometry 

Let P be a plane in R". If x G R" and x $ P, then there is a unique point 
x' G P such that the line in R" passing through x and x' is perpendicular to 
P. If x £ P, then put x' = x. 
For each p E P, 

(4.1) |x-p| 2 = |x-x'| 2 + |x'-p| 2 . 

If r > |x — x'|, then 

(4.2) \x-p\=r 
is equivalent to 

(4.3) \x'-p\=¥, =r 2 - \x-x'\ 2 

for p G P. 

Fix q G P and t > 0, and consider 

(4.4) {pGP: \p-q\ =t}. 

The maximum and minimum of \x' — p| on this set occur on the line through x' 
and q when x' 7^ The maximum and minimum of \x — p\ on this set occur at 
the same points. 
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5 The unit sphere 



For each positive integer n, let S™ -1 be the unit sphere in R n , consisting of the 
x G R™ such that |a;| = 1. This is the unit circle when n = 2, and it contains 
only the two elements ±1 when n = 1. Let us focus now on the case where 
n > 3. As before, the restriction of the Euclidean metric \x — y\ to x, y G S™ _1 
defines a metric on S™ _1 . The spherical metric d(x,y) can be defined be the 
conditions 

(5.1) 0<d(x,y)<7r 
and 

(5.2) sin (^)=^ 

for every x,y G S™ -1 . This is symmetric in x and y, and equal to exactly 
when x = y. In order to show that the triangle inequality holds, we would like 
to reduce to the case of the unit circle. 

Let Q be a two-dimensional plane in R™ passing through the origin. The 
intersection of Q with S™ -1 is a circle of radius 1, also known as a great circle 
in S" -1 . We can think of Q n S™ -1 as a copy of the unit circle, and d(x,y) 
for x, y G Q fl S" -1 corresponds exactly to the metric defined previously on S 1 . 
Thus d(x,y) also satisfies the triangle inequality on Q n S" _1 . 

Let x,y,z G S n_1 be given, and let us show that 

(5.3) d(x,z) < d(x,y) + d(y, z). 

This is trivial when x = y, and so we may suppose that x ^ y. Let Q be the two- 
dimensional plane in R™ passing through x, y, and the origin. Let A be the set 
of w G S" _1 such that \y — w\ = \y — z\, which is equivalent to d(y, w) — d(y, z). 
This is an (n — 2)-dimcnsional sphere contained in S"" 1 , except for the trivial 
cases where z = y or z = — y and A contains only z. We can also describe A 
as the intersection of S" _1 with a certain hyperplane H in R" perpendicular 
to the line L through y and 0. Using geometric arguments as in the previous 
section, one can show that \x — w\ is maximized on A at a point w G A HQ. 
Hence d(x,w) is maximized at the same point w . We also have that 

(5.4) d(x,w Q ) < d(x,y) +d(y,w ), 

because x, y, and wo are contained in the same great circle Q PI S™ _1 . This 
implies that the triangle inequality holds for x, y, and z, since d(y, z) = d(y 7 u> ) 
by definition of A and d(x, z) < d(x, wo) by maximization. 
As in the case of the unit circle, 

(5.5) d(x,y)=n 

exactly when x and y are antipodal points in S n_1 , which means that y = —x. 
This is also equivalent to saying that x and y are contained in the same line 
passing through 0. For every x,y G S n_1 , 

(5.6) \x-y\ < d(x,y) < | \x-y\, 
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and d(x, y) is approximately the same as \x — y\ when x and y are close together, 
in the sense that for every e > there is a S > such that 



(5.7) 



d(x,y) < (1 + e) \x - y 



when \x — y\ < 5. 

6 Supremum and infimum 

A real number b is said to be an upper bound for a set A C R if a < b for every 
a e A, and a real number c is said to be a lower bound for A if c < a for every 
a G A Note that A has both an upper and lower bound in R if and only if A 
is bounded with respect to the standard metric on R. 

A real number a is said to be the least upper bound or supremum of a set 
A C R if a is an upper bound for A, and if a < b for every upper bound b of A. If 
a, a' <E R both satisfy these conditions, then it follows that a < a' and a' < a, 
and hence a = a'. The completeness property of the real numbers states that 
a nonempty set A C R with an upper bound has a least upper bound, which 
is unique by the previous remark. The supremum of A is denoted sup A when 
it exists. If A has only finitely many elements, then the supremum of A is the 
same as the maximum of the elements of A. Otherwise, the supremum may not 
be an element of A. For example, if A is the set of all negative real numbers, 
then sup A — is not an element of A. 

The greatest lower bound or infimum of a set A C R is defined analogously 
as a lower bound for A which is greater than or equal to any other lower bound 
of A. The infimum of A is unique when it exists, in which case it is denoted 
inf A. It follows from the completeness property of the real numbers that A has 
an infimum when A ^ has a lower bound. Specifically, the infimum of A can 
be obtained as the supremum of the set of lower bounds for A. Alternatively, 
the infimum of A is the negative of the supremum of — A = {—a : a G A}. 

A set £ in a metric space (M,d(x,y)) is bounded if and only if the set of 
real numbers d(x,y), x,y G E, has an upper bound. If E C M is bounded and 
nonempty, then the diameter diam E of E is defined by 

(6.1) diami? = sup{rf(a;, y) : x, y G E}. 

7 Lipschitz mappings 

Let (M,d(x,yj) and (N,p(u 7 v)) be metric spaces. A mapping / : M — > N is 
said to be Lipschitz with constant C > if 



for every x, y G M. We may also simply say that / is C-Lipschitz in this case. 
Thus a mapping is O-Lipschitz if and only if it is constant. 



(7.1) 



p(f(x)J(y))<Cd(x,y) 
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A mapping / : M — > R is C-Lipschitz with respect to the standard metric 
on the real line if and only if 

(7.2) f(x)<f(y) + Cd(x,y) 

for every x,y £ M. This follows by interchanging the roles of x and y. In 
particular, f p (x) = d(p, x) is 1-Lipschitz for every p s M. 

Lipschitz mappings are automatically uniformly continuous. If / is a C- 
Lipschitz mapping from M into N and E C M is nonempty and bounded, 
then 

(7.3) diamjv/(S) < Cdiam M -E, 

where the subscripts indicate in which metric space the diameter is taken. 

Let (Mi.di), (M 2 ,d 2 ), and (M 3 ,d 3 ) be metric spaces. If /i : Mi -> M 2 and 
/ 2 : M 2 — > M 3 are Lipschitz mappings with constants Ci,C 2 > 0, respectively, 
then the composition / 2 o /1 : Mi — > M 3 defined by (/ 2 fi){ x ) = /2(/i(^)) is 
Lipschitz with constant Ci C 2 . 

8 Lengths of curves 

Let (M, d(x,y)) be a metric space, and let a, b be real numbers with a < b. 
The closed interval [a, b] is defined as usual as the set of real numbers t such 
that a < t < b. Let p : [a, 6] — + M be a continuous mapping, which is to say a 
continuous path in M defined on [a, b]. 

A partition V of [o, b] is a finite sequence {tj}™ = o °^ rea ^ numbers such that 

(8.1) a = t < *i < • • • < t n = b. 
For each such partition V, put 

n 

(8.2) A 7J = ^d(p(t J -),p(ti-i)). 

This is an approximation to the length of p(t), a < t < b. If n = 1, then 

(8.3) At> = d(p(a),p(6)). 

We say that p has /imfe length if the numbers A-p have an upper bound, 
uniformly over all partitions V of [a, 6]. In this case, the length of p is denoted 
A and defined to be the supremum of the Ap's. Thus 

(8.4) d(p(a),p(bj) < A. 

Similarly, one can show that p([a,b]) is a bounded set in M when p has finite 
length, and that 

(8.5) diamp([a, b]) < A. 

The condition of finite length is already nontrivial when M is the real line 
equipped with the standard metric, for which it is known classically as bounded 
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variation. The length of a real-valued function is also known as the total varia- 
tion. If p : [a, b] — ► R is monotone increasing, then p has bounded variation, and 
the total variation of p is equal to p(b) — p{a). However, one can give examples 
of continuous real- valued functions on closed intervals that do not have bounded 
variation. 



9 Special cases 

Let (M,d(x,y)) be a metric space, and let a, b be real numbers with a < b. 
We can think of [a, b] as being equipped with the restriction of the standard 
metric on the real line. If p : [a, b] — > M is C-Lipschitz for some C > 0, 
then A-p < C (b — a) for every partition V of [a, b\. Thus p has finite length 
A<C(b-a). 

Suppose that M is R™ with the standard Euclidean metric, and that p : 
[a, b] — > R n is continuously differentiablc. The fundamental theorem of calculus 
implies that 

(9.1) P(t)-p(r) = / p'(u)du 



when a < r < t < b. Because p'(u) is continuous on [a, b], \p'(u)\ is bounded on 
[a, 6], and p is Lipschitz on [a, b}. It follows that p has finite length on [a, b}. 
In this case, the length A of p on [a, b] is given by 

(9.2) A= f \p'(u)\du. 

J a 

For if V is any partition of [a, b], then the previous formula implies that 

(9.3) A r < I \p'{u)\du. 

J a 

Hence 

(9.4) A < I \p'{u)\du. 

J a 

To get the oppposite inequality, one can use uniform continuity of p' on [a, b] to 
approximate the Riemann sums of the integral by A-p's. 

There are analogous statements for the metric djv associated to a norm N 
on R™. Any norm on R" is bounded by a constant multiple of the standard 
norm, which implies that a continuously-differentiable curve p : [a, b] — > R™ is 
also Lipschitz with respect to the metric dpj on R™, and thus has finite length. 
The norm of the integral of a continuous R™-valued function is less than or 
equal to the integral of the norm of the function, as in the case of the Euclidean 
norm, and hence 

(9.5) N(p(r)-p(t)) < f N{p'(u))du 

J r 
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when a < r < t < b. This implies that 

(9.6) Ap < [ N{p'{u))du 

J a 

for any partition V of [a, 6], where Ap is now the approximation to the length 
of the curve corresponding to the metric c?at. Therefore 

(9.7) A< f N(p'{u))du, 

J a 

and one can get the opposite inequality to conclude that 

(9.8) A= [ N(p'(u))du 

J a 

in the same way as for the Euclidean norm. 



10 Compositions 

Let (M,d(x,y)) and (N, p(u,v)) be metric spaces, and suppose that / is a 
Lipschitz mapping from M to N with constant C > 0. If p is a continuous 
mapping from [a, b] into M, then the composition fop is a continuous mapping 
from [a, b] into M. For each partition V of [a, b], the analogue of A-p for fop 
is bounded by C times A-p for p. It follows that fop has finite length if p has 
finite length, and that the length of / op is bounded by C times the length of p. 

Suppose that a, (3 are also real numbers such that a < (3, and that (f> is a 
one-to-one continuous mapping of [a, /3] onto [a, 6]. Thus maps every partition 
V of [a, (3] to a partition ^(P) of [a, 6], and every partition of [a, 6] is of the form 
4>(V) for some partition V of [a, /?]. The approximation to the length of p o 
associated to a partition "P of [a, /?] is equal to the approximation to the length 
of p associated to the corresponding partition 4>{V) of [a, b]. It follows that 
p : [a, b] — > M has finite length if and only if p o <f> : [a, f3] — > M does, in which 
event the lengths are the same. 

Now suppose that is a monotone increasing continuous mapping from 
[a,0\ onto [a, b\. If a < p < t < /3 and (/>(/?) = 0(r), then is constant on 
[p, t]. Although the correspondence between partitions of [a, [3] and [a, b] is a 
little more complicated, the approximations to the lengths of p and p o <fi still 
match up, because the intervals on which <j> is constant only add terms equal to 
to the approximations to the length of po<j). Consequently, p has finite length 
if and only if p o <fi does, and the lengths are again the same. 

One can also consider the composition p o <f> when is a continuous mapping 
from [a, f3] onto [a, b] which may not be monotone. If p o has finite length, 
then one can check that p has finite length, and that the length of p is less than 
or equal to the length of p o <f>. It is easy to have strict inequality, because (j> 
may retrace parts of [a, b] more than once. For example, if M is the real line 
and p is the identity mapping, then 4> = p o <j> may have unbounded variation or 
total variation strictly greater than b — a. 
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11 Refinements and subintervals 



Let (M,d(x,y)) be a metric space, and let p be a continuous mapping from a 
closed interval [a, b] in the real line into M. A partition Vi of [a, 6] is said to be 
a refinement of another partition Pi of [a, 6] if each point in V\ is also in Vi- 
If A-pj , Ap 2 are the corresponding approximations to the length of p, then one 
can use the triangle inequality to show that 

(11.1) A Vl <A V2 . 

Note that for every pair of partitions V, V of [a, b], there is a partition V" of 
[a, b] which is a refinement of both V and V' . 

If ai, b\ are real numbers such that a < a\ < b\ < b, then every partition 
of [ai,6i] can be extended to a partition of [a, b]. The approximation to the 
length of p on [ai,6i] corresponding to the first partition is less than or equal 
to the approximation to the length of p on [a, b] that corresponds to the second 
partition. If p has finite length on [a,b], then it follows that the restriction of 
p to [ai,&i] has finite length less than or equal to the length of p on [a, b]. Let 
the length of p on [a±, b{\ be denoted A(a l7 b\), so that the previous statement 
is expressed by the inequality 

(11.2) A(ai,6i) < A(a,b). 

For each r e [a,b], 

(11.3) A(a,r)+A(r,b)=A(a,b). 

Any partitions of [a,r] and [r,b] can be combined to get a partition of [a, b], 
and the sum of the corresponding approximations to A(a,r) and A(r, b) is an 
approximation to A(a, 6), which implies that A(a, r) + A(r, 6) < A(a, 6). Every 
partition of [a, 6] has a refinement of this form, which implies the opposite 
inequality. The same argument shows that p has finite length on [a, b] when its 
restrictions to [a, r] and [r, b] have finite length. 

As a function of r on [a, b], A(a, r) is monotone increasing, and one can also 
show that A(a,r) is continuous. It suffices to check that A(a,r) is continuous 
from the left, and that A(r, b) is continuous from the right. To do this, it 
is helpful to consider partitions of [a, r] and [r, b] for which the corresponding 
approximations to the length are close to the supremum, and to use the previous 
remarks about refinements of partitions. The continuity of p is also important 
here, to limit the effect of a term in the sums involving r. 

12 Curves of minimal length 

Let (M,d(x,y)) be a metric space, and suppose that p : [a,b] — > M is a con- 
tinuous curve of finite length. If A(a,r) is the length of the restriction of p 
to [a, r], then there is a continuous mapping q : [0, A(a, &)] — > M such that 
q(A(a,rj) = p(r). The main point is that p(r) is constant on any interval on 
which A(a, r) is constant, so that q is well-defined. Moreover, q is Lipschitz with 
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constant 1 , because the distance between the endpoints of a curve is less than 
or equal to the length of the curve. 

If closed and bounded subsets of M are compact, then there is a continuous 
curve of minimal length connecting any pair of elements of M for which there 
is a continuous curve of finite length. Using the observation in the preceding 
paragraph and a linear change of variables on the real line, it is enough to show 
that there is a Lipschitz mapping from the unit interval [0, 1] into M connecting 
the two points whose Lipschitz constant is as small as possible. One starts with 
a sequence of Lipschitz mappings from [0, 1] into M connecting the two points 
whose Lipschitz constants tend to the infimum. The Arzela-Ascoli theorem 
implies that a subsequence of this sequence converges uniformly to a Lipschitz 
mapping on [0, 1] with minimal Lipschitz constant. 

For example, line segments in R n yield paths of minimal length for the stan- 
dard metric, or for the metric associated to any norm. In any metric space, the 
length of a path is greater than or equal to the distance between the endpoints 
of the path. If the length of the path is equal to the distance between the end- 
points, then the path automatically has minimal length. This is exactly what 
happens for line segments in R™ with respect to any norm, although for some 
norms like || • 1^ and || • there are other paths of minimal length too. 

A curve in the unit sphere in R™ has finite length with respect to the Eu- 
clidean metric if and only if it has finite length with respect to the spherical 
metric, in which case the length of the curve is the same for both metrics. This 
is because the two metrics are approximately the same locally in such a precise 
way, and because one can use refinements of partitions to make sure that only 
local distances are used to determine the length of a path. An arc of a great 
circle in S™ -1 with length < ir has minimal length with respect to the spherical 
metric, because the distance between its endpoints is equal to its length. Such 
an arc therefore has minimal length with respect to the Euclidean metric as 
well. 
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